Overview

Dataset statistics

Number of variables28
Number of observations118
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.9 KiB
Average record size in memory225.1 B

Variable types

NUM18
BOOL6
CAT4

Reproduction

Analysis started2020-05-14 08:35:08.545512
Analysis finished2020-05-14 08:36:07.927497
Duration59.38 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

numSub is highly correlated with numPal and 2 other fieldsHigh correlation
numPal is highly correlated with numSub and 3 other fieldsHigh correlation
numVrb is highly correlated with numPal and 2 other fieldsHigh correlation
numDet is highly correlated with numPal and 3 other fieldsHigh correlation
numAdv is highly correlated with numVrbHigh correlation
numAdp is highly correlated with numPal and 2 other fieldsHigh correlation
Post has unique values Unique
Links I. has 98 (83.1%) zeros Zeros
Links E. has 6 (5.1%) zeros Zeros
numNum has 8 (6.8%) zeros Zeros

Variables

Post
Categorical

UNIQUE

Distinct count118
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size944.0 B
Por que café descafeinado tem gosto e aroma de café?
 
1
Crônica de uma tragédia anunciada: a morte assistida do Museu Nacional
 
1
[ENTREVISTA] Água em Marte: os próximos passos para a pesquisa espacial
 
1
O que tem a ver um tear com a era dos computadores?
 
1
Turma da Mônica e a ciência
 
1
Other values (113)
113
ValueCountFrequency (%) 
Por que café descafeinado tem gosto e aroma de café?10.8%
 
Crônica de uma tragédia anunciada: a morte assistida do Museu Nacional10.8%
 
[ENTREVISTA] Água em Marte: os próximos passos para a pesquisa espacial10.8%
 
O que tem a ver um tear com a era dos computadores?10.8%
 
Turma da Mônica e a ciência10.8%
 
[RETROSPECTIVA] 10 fatos do mundo da ciência em 201810.8%
 
Pílula anticoncepcional para o homem: é uma realidade?10.8%
 
Ostrava: cultura, história, ciência e tecnologia em uma só cidade10.8%
 
Resenha – O fim da eternidade – Isaac Asimov10.8%
 
10 momentos da ciência em 201910.8%
 
Other values (108)10891.5%
 

Length

Max length91
Median length38
Mean length40.77118644
Min length7

Categoria
Categorical

Distinct count8
Unique (%)6.8%
Missing0
Missing (%)0.0%
Memory size944.0 B
Ciência ao redor
43
O que que a ciência tem?
36
Profissão Cientista
13
Sci… what?
 
7
Outros
 
6
Other values (3)
13
ValueCountFrequency (%) 
Ciência ao redor4336.4%
 
O que que a ciência tem?3630.5%
 
Profissão Cientista1311.0%
 
Sci… what?75.9%
 
Outros65.1%
 
Ciência Pop65.1%
 
ABC da ciência43.4%
 
Você disse ciência?32.5%
 

Length

Max length24
Median length16
Mean length17.66101695
Min length6

Área
Categorical

Distinct count11
Unique (%)9.3%
Missing0
Missing (%)0.0%
Memory size944.0 B
Ciência
22
Química
20
Biologia
18
Física
17
História
11
Other values (6)
30
ValueCountFrequency (%) 
Ciência2218.6%
 
Química2016.9%
 
Biologia1815.3%
 
Física1714.4%
 
História119.3%
 
Medicina108.5%
 
Astronomia75.9%
 
Matemática54.2%
 
Psicologia43.4%
 
Tecnologia21.7%
 

Length

Max length11
Median length7.5
Mean length7.711864407
Min length6

Mídia
Real number (ℝ≥0)

Distinct count6
Unique (%)5.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6101694915254237
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile4.15
Maximum7
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.226724485
Coefficient of variation (CV)0.7618604697
Kurtosis6.409973895
Mean1.610169492
Median Absolute Deviation (MAD)0
Skewness2.481564193
Sum190
Variance1.504852962
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
18471.2%
 
21613.6%
 
386.8%
 
543.4%
 
443.4%
 
721.7%
 
ValueCountFrequency (%) 
18471.2%
 
21613.6%
 
386.8%
 
443.4%
 
543.4%
 
ValueCountFrequency (%) 
721.7%
 
543.4%
 
443.4%
 
386.8%
 
21613.6%
 

SEO
Boolean

Distinct count2
Unique (%)1.7%
Missing0
Missing (%)0.0%
Memory size944.0 B
1
100
0
 
18
ValueCountFrequency (%) 
110084.7%
 
01815.3%
 

Links I.
Real number (ℝ≥0)

ZEROS

Distinct count7
Unique (%)5.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3813559322033898
Minimum0
Maximum7
Zeros98
Zeros (%)83.1%
Memory size944.0 B

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum7
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.108779968
Coefficient of variation (CV)2.907467472
Kurtosis17.81086123
Mean0.3813559322
Median Absolute Deviation (MAD)0
Skewness3.97935651
Sum45
Variance1.229393018
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
09883.1%
 
197.6%
 
265.1%
 
321.7%
 
710.8%
 
610.8%
 
510.8%
 
ValueCountFrequency (%) 
09883.1%
 
197.6%
 
265.1%
 
321.7%
 
510.8%
 
ValueCountFrequency (%) 
710.8%
 
610.8%
 
510.8%
 
321.7%
 
265.1%
 

Links E.
Real number (ℝ≥0)

ZEROS

Distinct count24
Unique (%)20.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.432203389830509
Minimum0
Maximum55
Zeros6
Zeros (%)5.1%
Memory size944.0 B

Quantile statistics

Minimum0
5-th percentile0.85
Q13
median5
Q38
95-th percentile23
Maximum55
Range55
Interquartile range (IQR)5

Descriptive statistics

Standard deviation8.564964226
Coefficient of variation (CV)1.152412518
Kurtosis12.50842563
Mean7.43220339
Median Absolute Deviation (MAD)3
Skewness3.187866256
Sum877
Variance73.3586122
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
51512.7%
 
21210.2%
 
31210.2%
 
41210.2%
 
186.8%
 
686.8%
 
786.8%
 
886.8%
 
065.1%
 
954.2%
 
Other values (14)2420.3%
 
ValueCountFrequency (%) 
065.1%
 
186.8%
 
21210.2%
 
31210.2%
 
41210.2%
 
ValueCountFrequency (%) 
5510.8%
 
4610.8%
 
4110.8%
 
3110.8%
 
2810.8%
 

Complexidade
Categorical

Distinct count3
Unique (%)2.5%
Missing0
Missing (%)0.0%
Memory size944.0 B
2
65
1
36
3
17
ValueCountFrequency (%) 
26555.1%
 
13630.5%
 
31714.4%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count2
Unique (%)1.7%
Missing0
Missing (%)0.0%
Memory size944.0 B
1
106
0
 
12
ValueCountFrequency (%) 
110689.8%
 
01210.2%
 

Analogias
Boolean

Distinct count2
Unique (%)1.7%
Missing0
Missing (%)0.0%
Memory size944.0 B
0
101
1
 
17
ValueCountFrequency (%) 
010185.6%
 
11714.4%
 
Distinct count2
Unique (%)1.7%
Missing0
Missing (%)0.0%
Memory size944.0 B
1
89
0
29
ValueCountFrequency (%) 
18975.4%
 
02924.6%
 

Siglas
Boolean

Distinct count2
Unique (%)1.7%
Missing0
Missing (%)0.0%
Memory size944.0 B
1
95
0
23
ValueCountFrequency (%) 
19580.5%
 
02319.5%
 

Visualizações
Real number (ℝ≥0)

Distinct count115
Unique (%)97.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean468.8220338983051
Minimum68
Maximum3044
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum68
5-th percentile90.55
Q1208.25
median364
Q3561.75
95-th percentile1040
Maximum3044
Range2976
Interquartile range (IQR)353.5

Descriptive statistics

Standard deviation466.6157963
Coefficient of variation (CV)0.995294083
Kurtosis13.55855508
Mean468.8220339
Median Absolute Deviation (MAD)179
Skewness3.299606831
Sum55321
Variance217730.3014
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
32621.7%
 
37021.7%
 
15621.7%
 
51010.8%
 
47910.8%
 
33210.8%
 
33810.8%
 
8410.8%
 
8510.8%
 
8810.8%
 
Other values (105)10589.0%
 
ValueCountFrequency (%) 
6810.8%
 
7410.8%
 
7510.8%
 
8410.8%
 
8510.8%
 
ValueCountFrequency (%) 
304410.8%
 
269610.8%
 
221510.8%
 
217210.8%
 
119810.8%
 

numPal
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count112
Unique (%)94.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean702.0932203389831
Minimum203
Maximum1757
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum203
5-th percentile320.35
Q1435.5
median625
Q3898.5
95-th percentile1311.25
Maximum1757
Range1554
Interquartile range (IQR)463

Descriptive statistics

Standard deviation333.4175019
Coefficient of variation (CV)0.4748906444
Kurtosis0.3199087018
Mean702.0932203
Median Absolute Deviation (MAD)222
Skewness0.8913568873
Sum82847
Variance111167.2306
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
34532.5%
 
67821.7%
 
74921.7%
 
40021.7%
 
91321.7%
 
32210.8%
 
58010.8%
 
58310.8%
 
58410.8%
 
33010.8%
 
Other values (102)10286.4%
 
ValueCountFrequency (%) 
20310.8%
 
23610.8%
 
27110.8%
 
29210.8%
 
30010.8%
 
ValueCountFrequency (%) 
175710.8%
 
170210.8%
 
145210.8%
 
144110.8%
 
134110.8%
 

numPar
Real number (ℝ≥0)

Distinct count25
Unique (%)21.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.177966101694915
Minimum6
Maximum46
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum6
5-th percentile7
Q18
median11
Q317
95-th percentile25.3
Maximum46
Range40
Interquartile range (IQR)9

Descriptive statistics

Standard deviation6.756612913
Coefficient of variation (CV)0.5127204654
Kurtosis4.761099301
Mean13.1779661
Median Absolute Deviation (MAD)4
Skewness1.820963879
Sum1555
Variance45.65181805
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
71613.6%
 
111512.7%
 
8119.3%
 
10119.3%
 
997.6%
 
1565.1%
 
1854.2%
 
1754.2%
 
1454.2%
 
654.2%
 
Other values (15)3025.4%
 
ValueCountFrequency (%) 
654.2%
 
71613.6%
 
8119.3%
 
997.6%
 
10119.3%
 
ValueCountFrequency (%) 
4610.8%
 
3510.8%
 
3310.8%
 
2821.7%
 
2710.8%
 

numSub
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count96
Unique (%)81.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean222.67796610169492
Minimum73
Maximum591
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum73
5-th percentile99.85
Q1139.25
median201.5
Q3283
95-th percentile425.9
Maximum591
Range518
Interquartile range (IQR)143.75

Descriptive statistics

Standard deviation107.270477
Coefficient of variation (CV)0.4817291934
Kurtosis0.8315542167
Mean222.6779661
Median Absolute Deviation (MAD)68
Skewness1.037321746
Sum26276
Variance11506.95524
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
17232.5%
 
35832.5%
 
18232.5%
 
11632.5%
 
10332.5%
 
28532.5%
 
16921.7%
 
18721.7%
 
23021.7%
 
12021.7%
 
Other values (86)9278.0%
 
ValueCountFrequency (%) 
7310.8%
 
8410.8%
 
8510.8%
 
9410.8%
 
9810.8%
 
ValueCountFrequency (%) 
59110.8%
 
53010.8%
 
49410.8%
 
48210.8%
 
47710.8%
 

numAdj
Real number (ℝ≥0)

Distinct count58
Unique (%)49.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.90677966101695
Minimum14
Maximum125
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum14
5-th percentile21.85
Q133.25
median45
Q366
95-th percentile104
Maximum125
Range111
Interquartile range (IQR)32.75

Descriptive statistics

Standard deviation24.02562282
Coefficient of variation (CV)0.4719533033
Kurtosis0.5929379555
Mean50.90677966
Median Absolute Deviation (MAD)17.5
Skewness0.924089666
Sum6007
Variance577.2305519
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2675.9%
 
6154.2%
 
3854.2%
 
4054.2%
 
3654.2%
 
6743.4%
 
3443.4%
 
7143.4%
 
6532.5%
 
6932.5%
 
Other values (48)7361.9%
 
ValueCountFrequency (%) 
1410.8%
 
1921.7%
 
2021.7%
 
2110.8%
 
2221.7%
 
ValueCountFrequency (%) 
12510.8%
 
12210.8%
 
11610.8%
 
10810.8%
 
10432.5%
 

numVrb
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count85
Unique (%)72.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean97.0
Minimum24
Maximum287
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum24
5-th percentile37.85
Q160.25
median87.5
Q3127.5
95-th percentile186.45
Maximum287
Range263
Interquartile range (IQR)67.25

Descriptive statistics

Standard deviation49.13734458
Coefficient of variation (CV)0.5065705627
Kurtosis1.116462357
Mean97
Median Absolute Deviation (MAD)30.5
Skewness1.044007179
Sum11446
Variance2414.478632
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6454.2%
 
7643.4%
 
16843.4%
 
6532.5%
 
12832.5%
 
9432.5%
 
6021.7%
 
5921.7%
 
4821.7%
 
4421.7%
 
Other values (75)8874.6%
 
ValueCountFrequency (%) 
2410.8%
 
3110.8%
 
3221.7%
 
3510.8%
 
3710.8%
 
ValueCountFrequency (%) 
28710.8%
 
22910.8%
 
20510.8%
 
20410.8%
 
19810.8%
 

numNEs
Real number (ℝ≥0)

Distinct count71
Unique (%)60.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.36440677966102
Minimum3
Maximum269
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum3
5-th percentile7
Q114
median29.5
Q364.75
95-th percentile124.5
Maximum269
Range266
Interquartile range (IQR)50.75

Descriptive statistics

Standard deviation46.93998082
Coefficient of variation (CV)0.991039137
Kurtosis5.488260223
Mean47.36440678
Median Absolute Deviation (MAD)18.5
Skewness2.0540532
Sum5589
Variance2203.361799
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1375.9%
 
1165.1%
 
843.4%
 
9232.5%
 
3432.5%
 
2632.5%
 
2132.5%
 
1932.5%
 
6132.5%
 
1432.5%
 
Other values (61)8067.8%
 
ValueCountFrequency (%) 
310.8%
 
410.8%
 
510.8%
 
621.7%
 
732.5%
 
ValueCountFrequency (%) 
26910.8%
 
22810.8%
 
20610.8%
 
14610.8%
 
13410.8%
 

numDet
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count71
Unique (%)60.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.65254237288136
Minimum25
Maximum201
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum25
5-th percentile38.85
Q157
median82.5
Q3116
95-th percentile176.15
Maximum201
Range176
Interquartile range (IQR)59

Descriptive statistics

Standard deviation43.55375441
Coefficient of variation (CV)0.4752050874
Kurtosis-0.4048510702
Mean91.65254237
Median Absolute Deviation (MAD)31
Skewness0.7351832424
Sum10815
Variance1896.929523
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5154.2%
 
7854.2%
 
17743.4%
 
4543.4%
 
7443.4%
 
10132.5%
 
17532.5%
 
6232.5%
 
8332.5%
 
3732.5%
 
Other values (61)8168.6%
 
ValueCountFrequency (%) 
2510.8%
 
3510.8%
 
3732.5%
 
3810.8%
 
3921.7%
 
ValueCountFrequency (%) 
20121.7%
 
17743.4%
 
17610.8%
 
17532.5%
 
17010.8%
 

numConj
Real number (ℝ≥0)

Distinct count43
Unique (%)36.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.466101694915253
Minimum4
Maximum76
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum4
5-th percentile8
Q115
median20
Q331.75
95-th percentile51.75
Maximum76
Range72
Interquartile range (IQR)16.75

Descriptive statistics

Standard deviation14.2329113
Coefficient of variation (CV)0.5817400533
Kurtosis1.671174383
Mean24.46610169
Median Absolute Deviation (MAD)7
Skewness1.304146207
Sum2887
Variance202.5757642
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1475.9%
 
1575.9%
 
2465.1%
 
1965.1%
 
3465.1%
 
1754.2%
 
1854.2%
 
1654.2%
 
2354.2%
 
954.2%
 
Other values (33)6151.7%
 
ValueCountFrequency (%) 
410.8%
 
743.4%
 
821.7%
 
954.2%
 
1132.5%
 
ValueCountFrequency (%) 
7610.8%
 
6910.8%
 
6610.8%
 
5910.8%
 
5710.8%
 

numAdv
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count60
Unique (%)50.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41.33050847457627
Minimum6
Maximum127
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum6
5-th percentile12.85
Q122
median37
Q352
95-th percentile84.35
Maximum127
Range121
Interquartile range (IQR)30

Descriptive statistics

Standard deviation24.14417347
Coefficient of variation (CV)0.5841731535
Kurtosis1.029176415
Mean41.33050847
Median Absolute Deviation (MAD)15
Skewness1.055467197
Sum4877
Variance582.9411126
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2654.2%
 
1854.2%
 
3654.2%
 
4943.4%
 
2743.4%
 
2143.4%
 
2032.5%
 
5232.5%
 
4432.5%
 
4232.5%
 
Other values (50)7966.9%
 
ValueCountFrequency (%) 
610.8%
 
710.8%
 
1021.7%
 
1110.8%
 
1210.8%
 
ValueCountFrequency (%) 
12710.8%
 
10710.8%
 
10232.5%
 
9210.8%
 
8310.8%
 

numAdp
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count82
Unique (%)69.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean106.59322033898304
Minimum25
Maximum269
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum25
5-th percentile44
Q165.5
median95
Q3135.25
95-th percentile208.65
Maximum269
Range244
Interquartile range (IQR)69.75

Descriptive statistics

Standard deviation53.49434626
Coefficient of variation (CV)0.5018550532
Kurtosis0.5212102647
Mean106.5932203
Median Absolute Deviation (MAD)36.5
Skewness0.9682661123
Sum12578
Variance2861.645082
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4943.4%
 
6943.4%
 
8643.4%
 
4232.5%
 
4432.5%
 
7132.5%
 
5832.5%
 
11532.5%
 
5332.5%
 
9532.5%
 
Other values (72)8572.0%
 
ValueCountFrequency (%) 
2510.8%
 
3510.8%
 
4232.5%
 
4432.5%
 
4510.8%
 
ValueCountFrequency (%) 
26910.8%
 
26610.8%
 
25610.8%
 
22210.8%
 
22110.8%
 

numNum
Real number (ℝ≥0)

ZEROS

Distinct count33
Unique (%)28.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.313559322033898
Minimum0
Maximum70
Zeros8
Zeros (%)6.8%
Memory size944.0 B

Quantile statistics

Minimum0
5-th percentile0
Q13
median8
Q312.75
95-th percentile37.6
Maximum70
Range70
Interquartile range (IQR)9.75

Descriptive statistics

Standard deviation12.94610733
Coefficient of variation (CV)1.144300123
Kurtosis5.236466333
Mean11.31355932
Median Absolute Deviation (MAD)5
Skewness2.185885602
Sum1335
Variance167.6016949
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2119.3%
 
7119.3%
 
1108.5%
 
086.8%
 
1086.8%
 
975.9%
 
875.9%
 
354.2%
 
554.2%
 
1254.2%
 
Other values (23)4134.7%
 
ValueCountFrequency (%) 
086.8%
 
1108.5%
 
2119.3%
 
354.2%
 
443.4%
 
ValueCountFrequency (%) 
7010.8%
 
5710.8%
 
5210.8%
 
5110.8%
 
4710.8%
 

Pergunta
Boolean

Distinct count2
Unique (%)1.7%
Missing0
Missing (%)0.0%
Memory size944.0 B
0
73
1
45
ValueCountFrequency (%) 
07361.9%
 
14538.1%
 

tamParagraf
Real number (ℝ≥0)

Distinct count109
Unique (%)92.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean420.6525423728813
Minimum171
Maximum788
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum171
5-th percentile217.1
Q1313.75
median424.5
Q3512
95-th percentile646.5
Maximum788
Range617
Interquartile range (IQR)198.25

Descriptive statistics

Standard deviation132.8897993
Coefficient of variation (CV)0.315913458
Kurtosis-0.4869074441
Mean420.6525424
Median Absolute Deviation (MAD)104.5
Skewness0.2301807335
Sum49637
Variance17659.69875
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
46043.4%
 
35821.7%
 
34221.7%
 
47621.7%
 
29421.7%
 
24021.7%
 
47221.7%
 
33710.8%
 
34010.8%
 
43110.8%
 
Other values (99)9983.9%
 
ValueCountFrequency (%) 
17110.8%
 
18010.8%
 
18310.8%
 
18410.8%
 
20410.8%
 
ValueCountFrequency (%) 
78810.8%
 
68910.8%
 
68510.8%
 
68410.8%
 
67210.8%
 

tamTitulo
Real number (ℝ≥0)

Distinct count54
Unique (%)45.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40.83898305084746
Minimum7
Maximum91
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum7
5-th percentile19.7
Q127
median38
Q352
95-th percentile71
Maximum91
Range84
Interquartile range (IQR)25

Descriptive statistics

Standard deviation16.94383398
Coefficient of variation (CV)0.4148936315
Kurtosis-0.1966609373
Mean40.83898305
Median Absolute Deviation (MAD)12.5
Skewness0.4902576466
Sum4819
Variance287.0935101
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3765.1%
 
5254.2%
 
2254.2%
 
4054.2%
 
3654.2%
 
4554.2%
 
4243.4%
 
2543.4%
 
2743.4%
 
3132.5%
 
Other values (44)7261.0%
 
ValueCountFrequency (%) 
721.7%
 
1110.8%
 
1410.8%
 
1510.8%
 
1810.8%
 
ValueCountFrequency (%) 
9110.8%
 
7821.7%
 
7421.7%
 
7121.7%
 
7010.8%
 

Dias
Real number (ℝ≥0)

Distinct count117
Unique (%)99.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean353.7881355932203
Minimum6
Maximum634
Zeros0
Zeros (%)0.0%
Memory size944.0 B

Quantile statistics

Minimum6
5-th percentile46.95
Q1196.75
median363
Q3527
95-th percentile589.45
Maximum634
Range628
Interquartile range (IQR)330.25

Descriptive statistics

Standard deviation187.040233
Coefficient of variation (CV)0.5286786473
Kurtosis-1.257968282
Mean353.7881356
Median Absolute Deviation (MAD)166.5
Skewness-0.2557774436
Sum41747
Variance34984.04875
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
42621.7%
 
62510.8%
 
33010.8%
 
57310.8%
 
31810.8%
 
57510.8%
 
32110.8%
 
57810.8%
 
58010.8%
 
6910.8%
 
Other values (107)10790.7%
 
ValueCountFrequency (%) 
610.8%
 
1310.8%
 
2010.8%
 
2710.8%
 
3410.8%
 
ValueCountFrequency (%) 
63410.8%
 
62610.8%
 
62510.8%
 
62210.8%
 
59410.8%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

PostCategoriaÁreaMídiaSEOLinks I.Links E.ComplexidadeIntroduçãoAnalogiasInteraçãoSiglasVisualizaçõesnumPalnumParnumSubnumAdjnumVrbnumNEsnumDetnumConjnumAdvnumAdpnumNumPerguntatamParagraftamTituloDias
0Livro – Professor, para que estudo isso?ABC da ciênciaCiência1002110113863459137304147391115494130840634
1Coleção Contém QuímicaABC da ciênciaQuímica21031101020953218223376079662313848020422626
2O que é um podcast? É tipo rádio? Como faço para começar a ouvir?OutrosTecnologia71028211107639133332242152109118295313216118065625
3Nojo no mundo animalSci… what?Biologia1103211103263226120204320371410584051420622
4Prêmio Nobel 2017 – Microscopia Eletrônica com CriogeniaSci… what?Química1100300012665571322740558669182010714034056594
5Que seja eterno enquanto dure…Sci… what?Biologia1108211102074009116255327517215817033232592
6O que dá cor aos fogos de artifício?Ciência ao redorQuímica1103310118274007142265621511515650147236589
7Por que café descafeinado tem gosto e aroma de café?O que que a ciência tem?Química1107300015833457103145411511216567138452587
8Arco-íris de sons? O que seria?O que que a ciência tem?Física1000311106865779185258613931535950147831585
9[ENTREVISTA] Água em Marte: os próximos passos para a pesquisa espacialProfissão CientistaAstronomia2101210104421129203827014012314922721847037871584

Last rows

PostCategoriaÁreaMídiaSEOLinks I.Links E.ComplexidadeIntroduçãoAnalogiasInteraçãoSiglasVisualizaçõesnumPalnumParnumSubnumAdjnumVrbnumNEsnumDetnumConjnumAdvnumAdpnumNumPerguntatamParagraftamTituloDias
108Por que aquele saboroso cafezinho espanta o nosso sono?Ciência ao redorQuímica1002210111565641217334761488939861013215562
10910 momentos da ciência em 2019Ciência ao redorCiência315412111088121927383851686413737661845102933061
110O que são explicações exploráveis?O que que a ciência tem?Tecnologia1101321011754371010736801053203651113423455
111A percepção dos brasileiros sobre a ciênciaOutrosCiência1121311010132101111246871685011549631314706894348
112A química dos saboresO que que a ciência tem?Química31032101120898717285821423214545471331204082141
113As especiarias e os aromasO que que a ciência tem?Química21010200011075631118267642986322469404162634
114Existem infinitos maiores que outros?O que que a ciência tem?Matemática110821111857882818772123141022652795711713727
115Seres BioluminescentesCiência ao redorBiologia11062101196652919561104778154493006152220
116O que tem a ver um tear com a era dos computadores?Profissão CientistaHistória2115210119960810182358836981936891314605113
117O Paradoxo de Simpson te mostra que nem tudo é o que pareceO que que a ciência tem?Matemática110421011847592120246104351001941100700234596